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Monozygotic twins affected witli major depressive 
disorder liave greater variance in methylation tlian 
tlieir unaffected co-twin 

EM Byrne^'", T Carrillo-Roa^ ^ AK Menders^ L Bowdler^ AF McRae^ ^ AC Heath^ NG Martin^ GW Montgomery^ L Krause^'^ and 
NR Wray^'^ 

Our understanding of major depressive disorder (IVIDD) has focused on the influence of genetic variation and environmental risk 
factors. Growing evidence suggests the additional role of epigenetic mechanisms influencing susceptibility for complex traits. 
DNA sequence within discordant monozygotic twin (MZT) pairs is virtually identical; thus, they represent a powerful design for 
studying the contribution of epigenetic factors to disease liability. The aim of this study was to investigate whether specific 
methylation profiles in white blood cells could contribute to the aetiology of MDD. Participants were drawn from the Queensland 
Twin Registry and comprised 12 MZT pairs discordant for MDD and 12 MZT pairs concordant for no MDD and low neuroticism. 
Bisulphite treatment and genome-wide interrogation of differentially methylated CpG sites using the lllumina Human Methylation 
450 BeadChip were performed in WBC-derived DNA. No overall difference in mean global methylation between cases and their 
unaffected co-twins was found; however, the differences in females was significant (P= 0.005). The difference in variance across 
all probes between affected and unaffected twins was highly significant {P<2.2 x 10 ~^^), with 52.4% of probes having higher 
variance in cases (binomial P-value<2.2 x 10 '^^). No significant differences in methylation were observed between discordant 
MZT pairs and their matched concordant MZT (permutation minimum P= 0.11) at any individual probe. Larger samples are likely 
to be needed to identify true associations between methylation differences at specific CpG sites. 
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Introduction 

Major depressive disorder (IVIDD) is a complex disorder with a 
considerable impact on the quality of patient's lives^ including 
an increased rate of mortality. ^'^ Lifetime prevalence is 
estimated to be 15% and it is twice more common in women 
than in men.^'"* Heritability of MDD is estimated to be ~40%.^ 
There have been no common genetic variants convincingly 
associated with MDD,® and most of the variants identified for 
complex psychiatric disorders contribute to a small fraction of 
the genetic variation and explain only a small proportion of 
the heritability.'' Growing evidence suggests that variations in 
epigenetic profiles (that is, DNA methylation) influence 
complex trait variation.® Epigenetic modifications do not 
directly interfere with the gene transcript but control tissue 
and temporal specificity of gene expression. DNA methy- 
lation is the most common epigenetic variation in the 
mammalian genome® and is not entirely stable; stochastic 
and environmental events can also generate variations over 
time.^^ Non-heritable influences on depression may mediate 
their biological effects through epigenetic mechanisms such 
as methylation of CpG dinucleotides. 

Several epigenetic studies have attempted to identify 
differences in DNA methylation associated with complex 
traits. Associations between variations in DNA methylation 



profiles and various psychiatric disorders (for example, 
schizophrenia, MDD, bipolar disorder and so on) have been 
reported. Some limitations should be considered from 
these studies (for example, small samples sizes, sample 
heterogeneity, epigenome coverage). Tissues involved in 
complex psychiatric disorders are not directly accessible from 
living patients; epigenetic studies often rely on peripheral 
tissue biomarkers such as buccal, gut and white blood cells 
(WBC).^® Post-mortem tissue can also be used in epigenetic 
studies to compare central and peripheral tissue;^® however, 
samples are not easily accessible. Moreover, post-mortem 
epigenetic measures may reflect treatment rather than 
disease. More research is needed to determine the accuracy 
of peripheral tissue biomarkers.''^ 

Discordant monozygotic twin (MZT) pairs constitute a 
powerful design in epigenetic studies. Single-nucleotide 
polymorphisms and other DNA sequence variations, which 
are abundant in singleton-based studies, are not a confound- 
ing source of variation in MZT-based epigenetic studies. 

The aim of our study was to examine whether diverse 
epigenetic profiles, specifically DNA methylation profiles, are 
associated with increased risk of MDD. We analysed genome- 
wide methylation patterns after bisulphite conversion of CpG 
sites using the lllumina Human Methylation (HM) 450 
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BeadChip (HM 450 BeadChip) of WBC-derived DNA from 
12 discordant MZT (6 male and 6 female) pairs and 
24 concordant MZT pairs. 

Materials and methods 

Sample selection/study design. Participants were drawn 
from the Queensland Twin Registry. Plienotypic information 
was collected as part of studies undertal<en at the Queens- 
land Institute of Medical Research each approved by the 
Institute's ethics committee. On the basis of the study, 
depressive symptoms were evaluated with either the Semi- 
Structured Assessment for the Genetics of Alcoholism, 
(adapted for use in Australia) or the Composite International 
Diagnostic Interview^^ questionnaires. A diagnosis of MDD 
was constructed from these questionnaires according to the 
Diagnostic and Statistical Manual of Mental Disorders-IV 
criteria.^"* A total of 17 958 individuals from 6855 families 
have completed psychiatric interviews, including a total of 
1812 MZT pairs, of whom 261 were found to be discordant 
for MDD. 

Six MZT female pairs (including five MZT female pairs 
previously epityped with the lllumina HM 27 BeadChip; 
lllumina, San Diego, CA, USA) and six MZT male pairs 
discordant for MDD were selected as 'case pairs'. Where 
possible, we selected case probands with (the more severe) 
recurrent MDD; all six MZT females and two of the six males 
met this criterion. Further, we aimed to select twins who were 
recorded as non-smokers and not alcohol or drug dependent, 
criteria that imposed severe restrictions on MZT selection, 
and so selection criteria were extended to include MZT that 
were matched for these criteria (for example, both smokers 
with similar daily consumption and age of initiation). Six MZT 
female pairs and six MZT male pairs concordant for no MDD 
and low neuroticism were selected as 'control pairs'. Neuroti- 
cism is a domain of personality easily measured by self-report. 
High neuroticism scores are associated with anxiety and 
depression; therefore, low scores are expected to represent 
those with a low liability for depression. This gives our study 
more power to detect variations in methylation status 
associated with MDD. Environmental factors such as smok- 
ing, alcohol or drug abuse are associated with altered 
methylation patterns. Each concordant MZT pair was 
selected to match as closely as possible to a discordant MZT 
pair on a number of key variables (Supplementary Tables 1 
and 2). Matching criteria were date of blood sample collection, 
(as methylation differences increase with age^°'^^) smoking 
and alcohol as well as previous history of drug use. A total of 
24 MZT pairs (48 individuals) were selected for our study 
(Table 1 ). All individuals were of Northern European ancestry. 



Bisulphite conversions. Bisulphite conversions of pre- 
viously extracted WBC-derived DNA from the 48 individuals, 
6 technical replicate samples and technical control samples 
(described below) were performed following the protocol and 
using reagents of the EZ DNA methylation Kit (Zymo 
Research, Irvine, CA, USA). Each conversion included 10 
samples and DNA recovery after conversion was quantified 
using Nanodrop (Thermo Scientific, Wilmington, DE, USA). 

In total, eight bisulphite conversions were undertaken, each 
comprising three to four MZT pairs, technical replicates 
and control samples. Later runs repeated eight MZT pairs 
that had generated <80% DNA recovery in earlier runs 
(Supplementary Table 3). 

Technical controls consisted of commercial DNA samples: 
Centre Etude Polymorphism Humain (CEPH) male 
(GM07029), CEPH female (GM06997) and FSK standard 
(Coriell Institute of Medical Research, Camden, NJ, USA). 
DNA recovery for the CEPH female sample after bisulphite 
conversion was ~ 50%; for this reason the CEPH male and 
the FSK standard were included in the later bisulphite 
conversions (Supplementary Table 3). Only FSK standard 
samples were included in the HM 450 BeadChips as controls 
because of their high percentage recovery of the bisulphite- 
converted DNA. 

Infinlum methylation assay. Bisulphite-converted DNA 
from 24 MZT pairs, 4 technical replicate samples and 4 
FSK standard controls were hybridised to the lllumina HM 
450 BeadChip, following the lllumina Infinlum HD methylation 
protocol and using reagents and conditions supplied by 
lllumina. The HM 450 BeadChip-assessed methylation 
status was interrogated at 485577 CpG sites across the 
genome. It provides coverage of 99% of RefSeq genes. 
Methylation scores for each CpG site are obtained as a ratio 
of the intensities of fluorescent signals and are represented 
as p-values. 

maxM 

B ~ value = -. ^. n — tt:^ 

' ( maxM + maxU + 1 00) 

where max = maximum intensity of fluorescence, M = 
methylated allele and U = unmethylated allele. The 100 in 
the denominator prevents division by 0 when both max M and 
max U are very small, but has little impact otherwise as max 
M + max U>1000 in >95% of sites.^^ p-values range 
between 0 and 1, where 0 means that no DNA molecules 
with a methylated CpG at the site of interest were detected in 
the sample, while 1 means that all DNA molecules had a 
methylated CpG site in the sample. 

Each MZT pair was placed on the same array and in 
neighbouring wells to avoid batch effects contributing to 



Table 1 Summary of cases and controls 



Cases 


Controls 


6 (5 + 1) MZT 5 pairs discordant for MDD 


6 MZT 2 pairs concordant for no MDD and low neuroticism 


6 MZT S pairs discordant for MDD 


6 MZT c? pairs concordant for no MDD and low neuroticism 


Total 24 MZT discordant for MDD 


Total 24 MZT concordant for no MDD and low neuroticism 



Abbreviations: MDD, major depressive disorder; MZT, monozygotic twin. 
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differences between pairs. A total of five HIVI 450 BeadCfiips 
were used for tfiis study. Two of tfie replicate samples were 
placed on the same array in non-adjacent wells from the 
corresponding MZT whereas the other two replicate samples 
were placed on different arrays (Supplementary Table 4). The 
FSK male and female samples were also duplicated and placed 
on different chips. A total of 56 samples were included in the 
arrays. Arrays were scanned with the lllumina iScan (Software 
Version 3.3.28) and the intensities of the images were extracted 
using the GenomeStudio (2010.3) methylation module (1.8.5) 
software. 

Quality control. Background was subtracted from our data 
using GenomeStudio (2010.3). Illumina's specification for the 
HM 450 BeadChip is 98% detection of GpG with P<0.05. 
All of our samples included in the five BeadChips exceeded 
specification, with >99% of GpG loci detected; therefore, no 
samples were excluded from the study. Three hundred 
seventy-three probes for which 75% of the samples had a 
detection P-value>0.05 were removed. A total of 485204 
probes remained. 

The p-values were transformed using a logit transformation 
in the IMA^° package in the R environment for statistical 
computing (version 2.13.2); http://www.R-project.org). Trans- 
forming via a logit transformation increases the validity of 
statistical tests of differential methylation.^^ Supplementary 
Figure 1 shows a histogram of the transformed |3-values 
across all samples. The peak correction method of Deuder- 
waerder et al.^'' was applied to correct for differences in 
results between the two chemistries utilised on the HIVI 450 
chip. QG plots produced by the IMA package are shown in 
Supplementary Figures 2-4. Unsupervised clustering of all 
samples showed that male and female samples clustered 
together. Furthermore, replicate samples and twin samples 
clustered together providing an indication of the accuracy of 
the methylation detection procedure. 

To improve the accuracy of our results, we excluded 
potential sources of technical and biological biases before 
analysing the methylation data. There is currently no standard 
procedure for filtering the samples and probes before 
methylation analysis. Our experimental design, including 
replicate samples enabled stringent quality control (QG) 
steps. These QG steps are described in detail in the 
Supplementary Methods. The number of sites annotated by 
probe type that were removed by the QG steps are shown in 
Table 2. After removing probes with large amounts of missing 
data and high levels of discordance between replicate pairs, a 
total of 462 002 (95.1%) probes remained for analysis. 

Statistical analysis. A two-sample Mest was used to test 
for differences in overall mean methylation between mono- 
zygotic cases and their unaffected co-twin. In order to test for 
differences in variance between cases and their unaffected 
co-twin, the variance for each probe was computed in cases 
and their unaffected co-twin separately, and the distributions 
were compared using the Wilcoxon signed-rank test. All 
statistical tests were implemented in R. 

Testing for differences between IVIZTs owing to IVIDD at 
each individual probe. In order to test for differences in 



Table 2 HM 450 BeadChip distribution of probes 


Annotation 


Before QC 


QC1 


QC2 


Island 


150254 


143391 


138738 


Shelf 


47161 


46447 


45691 


Shore 


112 072 


109736 


107 394 


NCpG 


176277 


172 695 


170178 


Total 


485557 


472269 


462 001 



Abbreviations: HIVI, human methylation; QC, quality control. 

QC1 represents remaining probes after QC1. QC2 represents remaining 

probes after QC2. 



methylation between monozygotic co-twins owing to MDD, a 
linear model was fitted. For each probe, the absolute 
difference between each twin pair was taken and regressed 
on matched twin set and case-control status. 

lyijki - yiii<2| = Sefi + MDDj + Sijk 

where, yijki = the residual transformed p-value of a probe 
after adjusting for batch and chip effects of the Ith (I = 1 ,2) 
twin of the jth twin pair (j = 1,2) from the ith set (1 = 1, ..,12). 
MDDj is an indicator such that MDDj = 1 for the discordant 
pairs and MDDj = 0 for the concordant pair eijk is the random 
error term. By taking the absolute difference in the residuals 
of the transformed p-value between MZT, the test can reflect 
both probes that are associated with more or less methyla- 
tion or probes that are more variably methylated probes 
associated with MDD. Under the MZT design we assume that 
any confounding variables are independent of the differences 
between MZT. 

To establish the significance of the top results, a set of 100 
permutations were performed. In each permutation, the 
concordant pair and discordant label were randomly assigned 
within each twin pair set, the analysis was performed for each 
probe, and the P-value for case-control status was extracted. 
The results of each permutation were extracted and ranked. 
The mean of the P-values with the same rank across all 100 
permutations was calculated and the distribution of the means 
was taken as the null distribution. An empirical P-value was 
calculated by counting the proportion of permutations that had 
at least one probe more significant than the top probe 
identified in the real data set. 

Pathway analysis. All genes for which at least one probe 
had a P-value < 0.001 for association with MDD status were 
uploaded to the commercial pathway analysis software 
Ingenuity Pathway Analysis (Ingenuity Systems, Redwood 
Gity, GA, USA) and the freely available software DAVID,^^ to 
test for enrichment of associations in functional pathways 
and gene ontology terms. 

Results 

Global differences in mean and variance. There was no 
significant difference in overall methylation between discor- 
dant pairs (P=0.56). As there were highly significant differ- 
ences in overall methylation of autosomal probes between 
males and females (P<2.2 x 10"^^; Supplementary 
Material) as previously reported by others, ^"^'^^ we tested 
for differences in overall methylation levels by sex within 
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discordant pairs. The difference in mean methylation was 
not significant (P=0.05), but was significant in females 
(P= 0.005), with cases being less methylated when com- 
pared with controls (whereas the trend of differences in 
males was the opposite). 

The difference in variance between cases and their control 
co-twins was highly significant (P<2.2 x 10"''^), with 52.4% 
of probes having higher variance in cases (binomial P-value 
2.2 X 10"^^). This increased variance in cases was consis- 
tent across all annotated probe types (Supplementary 
Table 6). The result was significant in males and females 
(both P<2.2x 10"^^). A jackknifing procedure where one 
twin pair was removed and the data reanalysed also gave 
significant results across all 12 replicates, indicating that the 
significant results are not attributable to one specific pair. 

The mean of the probe variances was 0.09 with s.d. 0.27 
with range 0.001-10.54. The distribution was highly skewed 
(Supplementary Figures 6a and b), with the majority of probes 
having a small variance compared with a much smaller 
number of probes that are highly variable. In order to test 
whether there were differences in the distribution of variances 
between cases and controls in probes with both high and low 
overall variance, we subdivided the probes into deciles based 
on their variance. We then performed the Wilcoxon signed- 



Table 3 Results from comparing distribution of variances in cases and controls 
by decile 



Decile 


Wilcoxon 
signed-rank test 
P-value 


Proportion of probes 
with increased 
variance in cases 


1 


<2.2x 10E-16 


0.545 


2 


<2.2x 10E-16 


0.539 


3 


<2.2x 10E-16 


0.539 


4 


<2.2x 10E-16 


0.530 


5 


<2.2x 10E-16 


0.531 


6 


<2.2x 10E-16 


0.522 


7 


<2.2x 10E-16 


0.519 


8 


6.50E-04 


0.508 


9 


8.79E-01 


0.501 


10 


1 .25E - 04 


0.504 



rank test for differences in variance in each of the probe sets. 
Across all of the deciles, more than half of the probes had a 
higher variance in cases than in controls. The differences in 
variance were significant in each of the deciles except for the 
ninth (Table 3). 

In order to test whether the differences in variance could be 
attributed to use of antidepressants, we tested for differences 
in variances between cases who had been prescribed 
antidepressants (n=6) and their co-twin control, and sepa- 
rately for cases who had not been prescribed antidepressants 
(n = 6) and their co-twin. Cases who had been prescribed 
antidepressants had increased variance at 51.5% of 
probes and the difference in variances was highly significant 
(P<2.2 X 10"^®). Cases that had not been prescribed 
antidepressants had increased variance relative to their 
unaffected co-twins at 51 .4% of probes and the difference in 
variances was highly significant (P<2.2 x 10"^®). 

MDD discordance at individual probes. The QQ plot for 
the genome-wide methylation analysis of concordant and 
discordant twins is shown in Supplementary Figure 5a. The 
observed distribution of P-values for the probes did not 
deviate from null distribution. The most significant probe was 
cgl 5083522 (P=1.38x 10"^, Table 4), found on chromo- 
some 10 in a CpG island upstream of the MAST4 gene. This 
probe was not significant when compared against the null 
distribution of all probes (P= 0. 1 1 ) or when compared with all 
probes in shores (P=0.06) (Supplementary Figures 5b-d). 
The MAST4 gene is expressed in a distinct pattern in white 
matter-containing regions of the brain in rodents. Other 
members of the MAST family have been implicated in 
dopaminergic signalling in dendrites.^'' However, this gene 
has not previously been linked to MDD or other mood 
disorders. 

Among the genes that harbour probes that are most 
significantly differentiated between discordant pairs, MAD1L1 
is the only gene that has been previously implicated in the 
aetiology of mood disorders. The 3rd most significant probe 
was located in the MAD1L1 gene (cg08985282, P=2.16 
X 10"^, Table 4). Recently, two independent genome-wide 



Table 4 Probes that are most significantly differentiated between discordant and concordant pairs (P< 1 0 



ILMNID 


CHR 


MAPINFO 


UCSC_REFGENE_NAME 


UCSC REFGENE 
ACCESSION 


RELATION TO 
UCSC_CPGJSLAND 


P-value 


cgl 0665379 


5 


65891878 


MAST4 


TSS1500 


Island 


1.38E-06 


cgl 72221 96 


1 


155057324 


EFNA3 


Body 


Island 


1.89E-05 


cg08985282 


7 


2089600 


MAD1L1 


Body 


Island 


2.16E-05 


cg25372357 


4 


56659770 


NA 


NA 


Island 


2.59E-05 


cg22864266 


12 


49319673 


FKBP11 


TSS1500 


S Shore 


2.68E-05 


cgl 0946423 


20 


44565633 


PCIF1 


5'UTR 


S_Shore 


2.98E-05 


cgl 184351 6 


1 


17023283 


ESPNP 


Body 


Island 


3.19E-05 


cg22374165 


2 


39199173 


LOCI 00271 71 5 


3'UTR 


NA 


3.65E-05 


cg22231758 


17 


69199815 


NA 


NA 


NA 


3.93E-05 


cgl 4852276 


3 


197684729 


IQCG 


5'UTR 


N_Shelf 


4.37E-05 


cg00270520 


2 


241579262 


NA 


NA 


N Shore 


4.54E-05 


cg06339265 


16 


11369615 


PRM2 


3'UTR 


N Shore 


5.76E-05 


cg041 45601 


20 


44641469 


MMP9 


Body 


N Shore 


5.98E-05 


cg09858767 


1 


14742375 


NA 


NA 


NA 


8.21 E- 05 


cg22618878 


6 


27156656 


NA 


NA 


NA 


8.28E-05 


cg07048516 


22 


24105087 


C22orf15 


TSS200 


NA 


9.08E-05 


cg03666973 


2 


27008764 


CENPA 


TSS200 


Island 


9.48E-05 
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significant hits for schizoplirenia in this gene were found, 
and evidence of association with bipolar disorder was found in 
a recent meta-analysis (rs4332037, P=6.3 x 10"'').^^ 

A total of 21 1 genes were uploaded to Ingenuity and DAVID 
pathway analysis packages. Neither pathway analysis pack- 
age identified any canonical pathways or biological functions 
to be significantly enriched after setting a false discovery 
rate of 5%. 

Discussion 

Previous studies involving MZT pairs discordant for psychia- 
tric disorders have been limited to a small number of twin pairs 
(for example, two MZT pairs for schizophrenia'*°), or have 
assessed only a fraction of the genome-wide CpG sites. ''^■''^ 
This study represents the largest twin study to date of 
genome-wide associated methylation differences in MZT 
pairs discordant for MDD. Furthermore, this is one of the first 
methylation studies in psychiatry that utilises the HM 450 
Beadchip, which provides coverage of a much larger number 
of CpG sites throughout the genome. 

Our analyses included sets of discordant MZT pairs and its 
matched concordant pairs. Although our sample size was not 
sufficient to identify methylation differences that could be 
attributed to MDD, it revealed that cases show a highly 
significantly increased variation in methylation throughout the 
genome (P<2.2x 10"^®) when compared with their control 
co-twins. This finding suggests that the methylome in patients 
with MDD may be undergoing more changes as a result of 
environmental influences than that in controls as a result of a 
loss of epigenetic stability. Increased variability of methylation 
in cases is seen across many cancers,"*^ but has not been 
previously investigated in studies of psychiatric disorders. 
One potential reason for this increased variance in cases may 
be due to various forms of treatment including the use of 
medications such as antidepressants by affected twins that 
are not used by the unaffected twins. However, the result 
remained significant when comparing those reporting anti- 
depressant use with their co-twins and also in those reporting 
no antidepressant use. No information was collected on the 
length of time that the drugs were used for or the dosage that 
was prescribed. Antidepressant drugs are known to induce 
epigenetic changes in the brain"*^ and may also do so in 
blood samples. 

Our results also showed that female cases had significantly 
increased methylation throughout the genome when com- 
pared with their unaffected co-twin (P= 0.005). This signifi- 
cant difference was not detected when comparing male cases 
with their unaffected co-twins or when analysing males and 
females together. We detected distinct patterns of methylation 
in male and female twins across a large number of autosomal 
probes, which suggests that analysing methylation differ- 
ences across the genome by sex is appropriate. 

There are a number of important limitations of the design 
and analysis of this study. First, the tissue available for 
methylation analysis in this study imposes limitations. 
Methylation levels are known to vary among tissues. Brain 
tissue would be the tissue of choice for the study of psychiatric 
disorders. However, biopsy is not an option and there is no 
available collection of post-mortem MZT brain tissue 



discordant for psychiatric disorders. For epigenetic studies 
of disorders related to neurodevelopment, it has been argued 
that cells that are derived from the same embryonic origin as 
the cells involved in the disease should be studied;^ for 
example, buccal cells have the same embryonic origin as 
brain cells. This argument is relevant for psychiatric disorders 
like schizophrenia where interest may focus on neurodeve- 
lopment. However, if environmental stresses influencing the 
liability of MDD occur after the early stages of development 
(for example, childhood trauma), then cells from the same 
embryonic origin as brain tissue (that is, buccal cells) seem no 
more relevant than WBC. Reliance on DMA from WBC means 
that the inferences that can be made regarding pathways 
involving the brain are limited. However, disease-related 
pathways can still be identified from peripheral tissues."*^ DNA 
methylation differences identified using peripheral tissue may 
not be explanatory or causal of changes in gene expression 
taking place in the brain, but they could be used as a 
diagnostic tool. 

A second important limitation of our study is the small 
sample size. The discordant MZT model is considered to be 
powerful for detecting epigenetic differences^^ because the 
cases and controls are matched on their genetic complement 
eliminating this as a confounding variable. In order to 
eliminate confounding environmental risk factors, we imposed 
stringent criteria on the selection of MZT. Only a small number 
of MZT pairs fulfilled these criteria despite selection from one 
of the largest twin collections worldwide. Larger samples 
can only be achieved through international consortia or 
through relaxation of the selection criteria. 

A third limitation of our study was the variability between 
samples in factors that may affect methylation. For example, 
first, the age of the subjects ranged from 31 to 63 (but was not 
significantly different between the sexes). Second, although it 
is estimated that 96% of the participants in the Queensland 
Institute of Medical Research studies have North European 
ancestry, '^'^ ancestry differences between twin pairs cannot be 
excluded. Third, environmental factors (that is, alcohol abuse 
and smoking) are associated with altered DNA methylation 
patterns. We matched pairs as closely as possible; however, 
some of the individuals in our sample were smokers or alcohol 
dependent. These examples show confounding factors, which 
may influence our results especially because our sample size 
is small. Future studies would ideally control for these and 
other confounding factors. 

Another limitation of our design of this study is that blood 
samples were collected after the onset of depression. There- 
fore, any differences in methylation associated between 
MZTs associated with MDD may reflect a range of factors 
including treatment. A prospective study design, in which 
samples are taken before first onset of MDD and then after 
disease onset, but before treatment would aid in the 
identification of epigenetic biomarkers associated with MDD. 
Further collection of samples collected during and after 
treatment may also help implicate sites that show the greatest 
changes in methylation after successful treatment of the 
disorder. 

The genetic architecture of complex traits is very hetero- 
geneous. Genome-wide association studies have been 
successful in identifying susceptibility variants for complex 
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traits."*^ However, most of the variants identified liave very 
small effect sizes. It is lil<ely tliat DNA metliylation differences 
involved in complex psychiatric disorders will also have small 
effects, and that sample sizes commensurate with those 
utilised in genome-wide association studies may be required 
to detect methylation differences at individual CpG islands. 
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